Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 9527 |
| Missing cells | 10496 |
| Missing cells (%) | 4.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.7 MiB |
| Average record size in memory | 184.0 B |
Variable types
| Categorical | 14 |
|---|---|
| Numeric | 9 |
ID has a high cardinality: 9527 distinct values | High cardinality |
Application_Receipt_Date has a high cardinality: 357 distinct values | High cardinality |
Applicant_BirthDate has a high cardinality: 5836 distinct values | High cardinality |
Manager_DOJ has a high cardinality: 646 distinct values | High cardinality |
Manager_DoB has a high cardinality: 1245 distinct values | High cardinality |
Office_PIN is highly correlated with Applicant_City_PIN | High correlation |
Applicant_City_PIN is highly correlated with Office_PIN | High correlation |
Manager_Num_Application is highly correlated with Manager_Num_Coded | High correlation |
Manager_Num_Coded is highly correlated with Manager_Num_Application | High correlation |
Manager_Business is highly correlated with Manager_Num_Products and 2 other fields | High correlation |
Manager_Num_Products is highly correlated with Manager_Business and 2 other fields | High correlation |
Manager_Business2 is highly correlated with Manager_Business and 2 other fields | High correlation |
Manager_Num_Products2 is highly correlated with Manager_Business and 2 other fields | High correlation |
Office_PIN is highly correlated with Applicant_City_PIN | High correlation |
Applicant_City_PIN is highly correlated with Office_PIN | High correlation |
Manager_Num_Application is highly correlated with Manager_Num_Coded | High correlation |
Manager_Num_Coded is highly correlated with Manager_Num_Application | High correlation |
Manager_Business is highly correlated with Manager_Num_Products and 2 other fields | High correlation |
Manager_Num_Products is highly correlated with Manager_Business and 2 other fields | High correlation |
Manager_Business2 is highly correlated with Manager_Business and 2 other fields | High correlation |
Manager_Num_Products2 is highly correlated with Manager_Business and 2 other fields | High correlation |
Office_PIN is highly correlated with Applicant_City_PIN | High correlation |
Applicant_City_PIN is highly correlated with Office_PIN | High correlation |
Manager_Business is highly correlated with Manager_Num_Products and 2 other fields | High correlation |
Manager_Num_Products is highly correlated with Manager_Business and 2 other fields | High correlation |
Manager_Business2 is highly correlated with Manager_Business and 2 other fields | High correlation |
Manager_Num_Products2 is highly correlated with Manager_Business and 2 other fields | High correlation |
Manager_Business2 is highly correlated with Manager_Business and 3 other fields | High correlation |
Manager_Business is highly correlated with Manager_Business2 and 3 other fields | High correlation |
Manager_Grade is highly correlated with Manager_Joining_Designation and 1 other fields | High correlation |
Manager_Joining_Designation is highly correlated with Manager_Grade and 2 other fields | High correlation |
Manager_Num_Products is highly correlated with Manager_Business2 and 3 other fields | High correlation |
Manager_Current_Designation is highly correlated with Manager_Grade and 1 other fields | High correlation |
Manager_Status is highly correlated with Manager_Business2 and 3 other fields | High correlation |
Applicant_City_PIN is highly correlated with Office_PIN | High correlation |
Manager_Num_Products2 is highly correlated with Manager_Business2 and 2 other fields | High correlation |
Office_PIN is highly correlated with Applicant_City_PIN | High correlation |
Manager_Joining_Designation is highly correlated with Manager_Current_Designation | High correlation |
Manager_Current_Designation is highly correlated with Manager_Joining_Designation | High correlation |
Applicant_City_PIN has 97 (1.0%) missing values | Missing |
Applicant_Occupation has 1221 (12.8%) missing values | Missing |
Manager_DOJ has 683 (7.2%) missing values | Missing |
Manager_Joining_Designation has 683 (7.2%) missing values | Missing |
Manager_Current_Designation has 683 (7.2%) missing values | Missing |
Manager_Grade has 683 (7.2%) missing values | Missing |
Manager_Status has 683 (7.2%) missing values | Missing |
Manager_Gender has 683 (7.2%) missing values | Missing |
Manager_DoB has 683 (7.2%) missing values | Missing |
Manager_Num_Application has 683 (7.2%) missing values | Missing |
Manager_Num_Coded has 683 (7.2%) missing values | Missing |
Manager_Business has 683 (7.2%) missing values | Missing |
Manager_Num_Products has 683 (7.2%) missing values | Missing |
Manager_Business2 has 683 (7.2%) missing values | Missing |
Manager_Num_Products2 has 683 (7.2%) missing values | Missing |
ID is uniformly distributed | Uniform |
Applicant_BirthDate is uniformly distributed | Uniform |
ID has unique values | Unique |
Manager_Num_Application has 2980 (31.3%) zeros | Zeros |
Manager_Num_Coded has 5283 (55.5%) zeros | Zeros |
Manager_Business has 2904 (30.5%) zeros | Zeros |
Manager_Num_Products has 2909 (30.5%) zeros | Zeros |
Manager_Business2 has 2909 (30.5%) zeros | Zeros |
Manager_Num_Products2 has 2914 (30.6%) zeros | Zeros |
Reproduction
| Analysis started | 2021-07-27 05:43:48.977680 |
|---|---|
| Analysis finished | 2021-07-27 05:44:17.593113 |
| Duration | 28.62 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 9527 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 74.6 KiB |
| FIN1000297 | 1 |
|---|---|
| FIN1008090 | 1 |
| FIN1006786 | 1 |
| FIN1003695 | 1 |
| FIN1001949 | 1 |
| Other values (9522) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 95270 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9527 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | FIN1000001 |
|---|---|
| 2nd row | FIN1000002 |
| 3rd row | FIN1000003 |
| 4th row | FIN1000004 |
| 5th row | FIN1000005 |
Common Values
| Value | Count | Frequency (%) |
| FIN1000297 | 1 | < 0.1% |
| FIN1008090 | 1 | < 0.1% |
| FIN1006786 | 1 | < 0.1% |
| FIN1003695 | 1 | < 0.1% |
| FIN1001949 | 1 | < 0.1% |
| FIN1005284 | 1 | < 0.1% |
| FIN1000520 | 1 | < 0.1% |
| FIN1008952 | 1 | < 0.1% |
| FIN1004152 | 1 | < 0.1% |
| FIN1003429 | 1 | < 0.1% |
| Other values (9517) | 9517 |
Length
| Value | Count | Frequency (%) |
| fin1005842 | 1 | < 0.1% |
| fin1008863 | 1 | < 0.1% |
| fin1008343 | 1 | < 0.1% |
| fin1002299 | 1 | < 0.1% |
| fin1002138 | 1 | < 0.1% |
| fin1001809 | 1 | < 0.1% |
| fin1006553 | 1 | < 0.1% |
| fin1003237 | 1 | < 0.1% |
| fin1003408 | 1 | < 0.1% |
| fin1007831 | 1 | < 0.1% |
| Other values (9517) | 9517 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 22963 | |
| 1 | 13440 | |
| F | 9527 | |
| I | 9527 | |
| N | 9527 | |
| 2 | 3911 | 4.1% |
| 3 | 3903 | 4.1% |
| 4 | 3903 | 4.1% |
| 5 | 3831 | 4.0% |
| 6 | 3803 | 4.0% |
| Other values (3) | 10935 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 66689 | |
| Uppercase Letter | 28581 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 22963 | |
| 1 | 13440 | |
| 2 | 3911 | 5.9% |
| 3 | 3903 | 5.9% |
| 4 | 3903 | 5.9% |
| 5 | 3831 | 5.7% |
| 6 | 3803 | 5.7% |
| 7 | 3803 | 5.7% |
| 8 | 3802 | 5.7% |
| 9 | 3330 | 5.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 9527 | |
| I | 9527 | |
| N | 9527 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 66689 | |
| Latin | 28581 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 22963 | |
| 1 | 13440 | |
| 2 | 3911 | 5.9% |
| 3 | 3903 | 5.9% |
| 4 | 3903 | 5.9% |
| 5 | 3831 | 5.7% |
| 6 | 3803 | 5.7% |
| 7 | 3803 | 5.7% |
| 8 | 3802 | 5.7% |
| 9 | 3330 | 5.0% |
Latin
| Value | Count | Frequency (%) |
| F | 9527 | |
| I | 9527 | |
| N | 9527 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 95270 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 22963 | |
| 1 | 13440 | |
| F | 9527 | |
| I | 9527 | |
| N | 9527 | |
| 2 | 3911 | 4.1% |
| 3 | 3903 | 4.1% |
| 4 | 3903 | 4.1% |
| 5 | 3831 | 4.0% |
| 6 | 3803 | 4.0% |
| Other values (3) | 10935 |
| Distinct | 98 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 452894.3722 |
| Minimum | 110005 |
|---|---|
| Maximum | 851101 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.6 KiB |
Quantile statistics
| Minimum | 110005 |
|---|---|
| 5-th percentile | 122002 |
| Q1 | 226001 |
| median | 416001 |
| Q3 | 695014 |
| 95-th percentile | 841428 |
| Maximum | 851101 |
| Range | 741096 |
| Interquartile range (IQR) | 469013 |
Descriptive statistics
| Standard deviation | 235690.6183 |
|---|---|
| Coefficient of variation (CV) | 0.5204096865 |
| Kurtosis | -1.28405354 |
| Mean | 452894.3722 |
| Median Absolute Deviation (MAD) | 194991 |
| Skewness | 0.3027009114 |
| Sum | 4314724684 |
| Variance | 5.555006753 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 695014 | 397 | 4.2% |
| 211001 | 257 | 2.7% |
| 221010 | 249 | 2.6% |
| 121002 | 236 | 2.5% |
| 400075 | 216 | 2.3% |
| 700016 | 192 | 2.0% |
| 444601 | 187 | 2.0% |
| 201301 | 184 | 1.9% |
| 208001 | 180 | 1.9% |
| 226001 | 176 | 1.8% |
| Other values (88) | 7253 |
| Value | Count | Frequency (%) |
| 110005 | 146 | |
| 110034 | 3 | < 0.1% |
| 121002 | 236 | |
| 122002 | 98 | |
| 124001 | 10 | 0.1% |
| 125001 | 78 | 0.8% |
| 141001 | 78 | 0.8% |
| 143001 | 12 | 0.1% |
| 144001 | 2 | < 0.1% |
| 160017 | 66 | 0.7% |
| Value | Count | Frequency (%) |
| 851101 | 104 | |
| 848101 | 80 | |
| 843302 | 110 | |
| 842001 | 145 | |
| 841428 | 99 | |
| 841226 | 66 | |
| 834001 | 120 | |
| 826001 | 6 | 0.1% |
| 824101 | 71 | |
| 814112 | 84 |
| Distinct | 357 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 74.6 KiB |
| 5/9/2007 | 165 |
|---|---|
| 5/8/2007 | 97 |
| 4/18/2007 | 86 |
| 5/7/2007 | 86 |
| 1/2/2008 | 85 |
| Other values (352) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.931562926 |
| Min length | 8 |
Characters and Unicode
| Total characters | 85091 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 4/16/2007 |
|---|---|
| 2nd row | 4/16/2007 |
| 3rd row | 4/16/2007 |
| 4th row | 4/16/2007 |
| 5th row | 4/16/2007 |
Common Values
| Value | Count | Frequency (%) |
| 5/9/2007 | 165 | 1.7% |
| 5/8/2007 | 97 | 1.0% |
| 4/18/2007 | 86 | 0.9% |
| 5/7/2007 | 86 | 0.9% |
| 1/2/2008 | 85 | 0.9% |
| 11/12/2007 | 83 | 0.9% |
| 5/5/2008 | 80 | 0.8% |
| 4/16/2007 | 79 | 0.8% |
| 12/6/2007 | 73 | 0.8% |
| 11/19/2007 | 71 | 0.7% |
| Other values (347) | 8622 |
Length
| Value | Count | Frequency (%) |
| 5/9/2007 | 165 | 1.7% |
| 5/8/2007 | 97 | 1.0% |
| 4/18/2007 | 86 | 0.9% |
| 5/7/2007 | 86 | 0.9% |
| 1/2/2008 | 85 | 0.9% |
| 11/12/2007 | 83 | 0.9% |
| 5/5/2008 | 80 | 0.8% |
| 4/16/2007 | 79 | 0.8% |
| 12/6/2007 | 73 | 0.8% |
| 11/19/2007 | 71 | 0.7% |
| Other values (347) | 8622 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 20475 | |
| / | 19054 | |
| 2 | 14378 | |
| 1 | 8431 | |
| 7 | 8009 | 9.4% |
| 8 | 4900 | 5.8% |
| 5 | 2841 | 3.3% |
| 6 | 2221 | 2.6% |
| 4 | 1863 | 2.2% |
| 9 | 1585 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 66037 | |
| Other Punctuation | 19054 | 22.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 20475 | |
| 2 | 14378 | |
| 1 | 8431 | |
| 7 | 8009 | 12.1% |
| 8 | 4900 | 7.4% |
| 5 | 2841 | 4.3% |
| 6 | 2221 | 3.4% |
| 4 | 1863 | 2.8% |
| 9 | 1585 | 2.4% |
| 3 | 1334 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 19054 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 85091 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 20475 | |
| / | 19054 | |
| 2 | 14378 | |
| 1 | 8431 | |
| 7 | 8009 | 9.4% |
| 8 | 4900 | 5.8% |
| 5 | 2841 | 3.3% |
| 6 | 2221 | 2.6% |
| 4 | 1863 | 2.2% |
| 9 | 1585 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85091 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 20475 | |
| / | 19054 | |
| 2 | 14378 | |
| 1 | 8431 | |
| 7 | 8009 | 9.4% |
| 8 | 4900 | 5.8% |
| 5 | 2841 | 3.3% |
| 6 | 2221 | 2.6% |
| 4 | 1863 | 2.2% |
| 9 | 1585 | 1.9% |
Applicant_City_PIN
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 2979 |
|---|---|
| Distinct (%) | 31.6% |
| Missing | 97 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 456784.5473 |
| Minimum | 110001 |
|---|---|
| Maximum | 995657 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.6 KiB |
Quantile statistics
| Minimum | 110001 |
|---|---|
| 5-th percentile | 121102 |
| Q1 | 226020 |
| median | 422005.5 |
| Q3 | 695017 |
| 95-th percentile | 843121 |
| Maximum | 995657 |
| Range | 885656 |
| Interquartile range (IQR) | 468997 |
Descriptive statistics
| Standard deviation | 239291.0812 |
|---|---|
| Coefficient of variation (CV) | 0.5238598429 |
| Kurtosis | -1.302812578 |
| Mean | 456784.5473 |
| Median Absolute Deviation (MAD) | 209497 |
| Skewness | 0.2739133409 |
| Sum | 4307478281 |
| Variance | 5.726022155 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 202001 | 103 | 1.1% |
| 492001 | 75 | 0.8% |
| 305001 | 64 | 0.7% |
| 452001 | 55 | 0.6% |
| 476001 | 51 | 0.5% |
| 281001 | 49 | 0.5% |
| 125001 | 48 | 0.5% |
| 285001 | 47 | 0.5% |
| 803101 | 46 | 0.5% |
| 274001 | 46 | 0.5% |
| Other values (2969) | 8846 | |
| (Missing) | 97 | 1.0% |
| Value | Count | Frequency (%) |
| 110001 | 2 | |
| 110003 | 3 | |
| 110004 | 2 | |
| 110005 | 2 | |
| 110006 | 4 | |
| 110007 | 4 | |
| 110008 | 1 | < 0.1% |
| 110009 | 2 | |
| 110010 | 1 | < 0.1% |
| 110014 | 4 |
| Value | Count | Frequency (%) |
| 995657 | 1 | < 0.1% |
| 888620 | 1 | < 0.1% |
| 856127 | 1 | < 0.1% |
| 854101 | 1 | < 0.1% |
| 853204 | 11 | |
| 853202 | 3 | < 0.1% |
| 853201 | 7 | |
| 853102 | 1 | < 0.1% |
| 852111 | 1 | < 0.1% |
| 851231 | 1 | < 0.1% |
Applicant_Gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 67 |
| Missing (%) | 0.7% |
| Memory size | 74.6 KiB |
| M | |
|---|---|
| F |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 9460 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 7179 | |
| F | 2281 | 23.9% |
| (Missing) | 67 | 0.7% |
Length
Pie chart
| Value | Count | Frequency (%) |
| m | 7179 | |
| f | 2281 | 24.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 7179 | |
| F | 2281 | 24.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9460 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 7179 | |
| F | 2281 | 24.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9460 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 7179 | |
| F | 2281 | 24.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9460 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 7179 | |
| F | 2281 | 24.1% |
| Distinct | 5836 |
|---|---|
| Distinct (%) | 61.7% |
| Missing | 73 |
| Missing (%) | 0.8% |
| Memory size | 74.6 KiB |
| 1/3/1978 | 24 |
|---|---|
| 1/3/1980 | 20 |
| 1/2/1977 | 18 |
| 1/3/1979 | 16 |
| 1/3/1968 | 13 |
| Other values (5831) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.806854242 |
| Min length | 8 |
Characters and Unicode
| Total characters | 83260 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3738 ? |
|---|---|
| Unique (%) | 39.5% |
Sample
| 1st row | 12/19/1971 |
|---|---|
| 2nd row | 2/17/1983 |
| 3rd row | 1/16/1966 |
| 4th row | 2/3/1988 |
| 5th row | 7/4/1985 |
Common Values
| Value | Count | Frequency (%) |
| 1/3/1978 | 24 | 0.3% |
| 1/3/1980 | 20 | 0.2% |
| 1/2/1977 | 18 | 0.2% |
| 1/3/1979 | 16 | 0.2% |
| 1/3/1968 | 13 | 0.1% |
| 1/3/1983 | 13 | 0.1% |
| 1/3/1976 | 13 | 0.1% |
| 1/2/1981 | 11 | 0.1% |
| 7/2/1980 | 10 | 0.1% |
| 1/3/1984 | 10 | 0.1% |
| Other values (5826) | 9306 | |
| (Missing) | 73 | 0.8% |
Length
| Value | Count | Frequency (%) |
| 1/3/1978 | 24 | 0.3% |
| 1/3/1980 | 20 | 0.2% |
| 1/2/1977 | 18 | 0.2% |
| 1/3/1979 | 16 | 0.2% |
| 1/3/1983 | 13 | 0.1% |
| 1/3/1968 | 13 | 0.1% |
| 1/3/1976 | 13 | 0.1% |
| 1/2/1981 | 11 | 0.1% |
| 7/3/1977 | 10 | 0.1% |
| 1/2/1973 | 10 | 0.1% |
| Other values (5826) | 9306 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 18908 | |
| 1 | 18022 | |
| 9 | 11626 | |
| 7 | 6754 | 8.1% |
| 8 | 6238 | 7.5% |
| 2 | 6091 | 7.3% |
| 6 | 4331 | 5.2% |
| 3 | 3302 | 4.0% |
| 5 | 3088 | 3.7% |
| 4 | 2742 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64352 | |
| Other Punctuation | 18908 | 22.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 18022 | |
| 9 | 11626 | |
| 7 | 6754 | 10.5% |
| 8 | 6238 | 9.7% |
| 2 | 6091 | 9.5% |
| 6 | 4331 | 6.7% |
| 3 | 3302 | 5.1% |
| 5 | 3088 | 4.8% |
| 4 | 2742 | 4.3% |
| 0 | 2158 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 18908 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 83260 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 18908 | |
| 1 | 18022 | |
| 9 | 11626 | |
| 7 | 6754 | 8.1% |
| 8 | 6238 | 7.5% |
| 2 | 6091 | 7.3% |
| 6 | 4331 | 5.2% |
| 3 | 3302 | 4.0% |
| 5 | 3088 | 3.7% |
| 4 | 2742 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 83260 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 18908 | |
| 1 | 18022 | |
| 9 | 11626 | |
| 7 | 6754 | 8.1% |
| 8 | 6238 | 7.5% |
| 2 | 6091 | 7.3% |
| 6 | 4331 | 5.2% |
| 3 | 3302 | 4.0% |
| 5 | 3088 | 3.7% |
| 4 | 2742 | 3.3% |
Applicant_Marital_Status
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73 |
| Missing (%) | 0.8% |
| Memory size | 74.6 KiB |
| M | |
|---|---|
| S | |
| W | 6 |
| D | 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 9454 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | S |
| 3rd row | M |
| 4th row | S |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 6177 | |
| S | 3267 | |
| W | 6 | 0.1% |
| D | 4 | < 0.1% |
| (Missing) | 73 | 0.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| m | 6177 | |
| s | 3267 | |
| w | 6 | 0.1% |
| d | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 6177 | |
| S | 3267 | |
| W | 6 | 0.1% |
| D | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9454 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 6177 | |
| S | 3267 | |
| W | 6 | 0.1% |
| D | 4 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9454 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 6177 | |
| S | 3267 | |
| W | 6 | 0.1% |
| D | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9454 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 6177 | |
| S | 3267 | |
| W | 6 | 0.1% |
| D | 4 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1221 |
| Missing (%) | 12.8% |
| Memory size | 74.6 KiB |
| Salaried | |
|---|---|
| Business | |
| Others | |
| Self Employed | 149 |
| Student | 101 |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.604141584 |
| Min length | 6 |
Characters and Unicode
| Total characters | 63160 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Others |
|---|---|
| 2nd row | Others |
| 3rd row | Business |
| 4th row | Salaried |
| 5th row | Others |
Common Values
| Value | Count | Frequency (%) |
| Salaried | 3787 | |
| Business | 2303 | |
| Others | 1966 | |
| Self Employed | 149 | 1.6% |
| Student | 101 | 1.1% |
| (Missing) | 1221 | 12.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| salaried | 3787 | |
| business | 2303 | |
| others | 1966 | |
| self | 149 | 1.8% |
| employed | 149 | 1.8% |
| student | 101 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 8875 | |
| e | 8455 | |
| a | 7574 | |
| i | 6090 | |
| r | 5753 | |
| l | 4085 | |
| S | 4037 | |
| d | 4037 | |
| u | 2404 | 3.8% |
| n | 2404 | 3.8% |
| Other values (11) | 9446 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54556 | |
| Uppercase Letter | 8455 | 13.4% |
| Space Separator | 149 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 8875 | |
| e | 8455 | |
| a | 7574 | |
| i | 6090 | |
| r | 5753 | |
| l | 4085 | |
| d | 4037 | |
| u | 2404 | 4.4% |
| n | 2404 | 4.4% |
| t | 2168 | 4.0% |
| Other values (6) | 2711 | 5.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4037 | |
| B | 2303 | |
| O | 1966 | |
| E | 149 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 149 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 63011 | |
| Common | 149 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 8875 | |
| e | 8455 | |
| a | 7574 | |
| i | 6090 | |
| r | 5753 | |
| l | 4085 | |
| S | 4037 | |
| d | 4037 | |
| u | 2404 | 3.8% |
| n | 2404 | 3.8% |
| Other values (10) | 9297 |
Common
| Value | Count | Frequency (%) |
| 149 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 8875 | |
| e | 8455 | |
| a | 7574 | |
| i | 6090 | |
| r | 5753 | |
| l | 4085 | |
| S | 4037 | |
| d | 4037 | |
| u | 2404 | 3.8% |
| n | 2404 | 3.8% |
| Other values (11) | 9446 |
Applicant_Qualification
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 86 |
| Missing (%) | 0.9% |
| Memory size | 74.6 KiB |
| Class XII | |
|---|---|
| Graduate | |
| Class X | 225 |
| Others | 132 |
| Masters of Business Administration | 74 |
| Other values (6) | 8 |
Length
| Max length | 64 |
|---|---|
| Median length | 9 |
| Mean length | 8.806694206 |
| Min length | 6 |
Characters and Unicode
| Total characters | 83144 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Graduate |
|---|---|
| 2nd row | Class XII |
| 3rd row | Class XII |
| 4th row | Class XII |
| 5th row | Class XII |
Common Values
| Value | Count | Frequency (%) |
| Class XII | 5806 | |
| Graduate | 3196 | |
| Class X | 225 | 2.4% |
| Others | 132 | 1.4% |
| Masters of Business Administration | 74 | 0.8% |
| Associate / Fellow of Institute of Chartered Accountans of India | 3 | < 0.1% |
| Professional Qualification in Marketing | 1 | < 0.1% |
| Associate/Fellow of Institute of Company Secretories of India | 1 | < 0.1% |
| Associate/Fellow of Insurance Institute of India | 1 | < 0.1% |
| Certified Associateship of Indian Institute of Bankers | 1 | < 0.1% |
| (Missing) | 86 | 0.9% |
Length
| Value | Count | Frequency (%) |
| class | 6031 | |
| xii | 5806 | |
| graduate | 3196 | |
| x | 225 | 1.4% |
| others | 132 | 0.8% |
| of | 92 | 0.6% |
| administration | 74 | 0.5% |
| business | 74 | 0.5% |
| masters | 74 | 0.5% |
| institute | 6 | < 0.1% |
| Other values (20) | 37 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 12667 | |
| a | 12599 | |
| I | 11626 | |
| 6306 | ||
| l | 6046 | |
| C | 6036 | |
| X | 6031 | |
| t | 3587 | 4.3% |
| e | 3511 | 4.2% |
| r | 3490 | 4.2% |
| Other values (24) | 11245 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49566 | |
| Uppercase Letter | 27266 | |
| Space Separator | 6306 | 7.6% |
| Other Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 12667 | |
| a | 12599 | |
| l | 6046 | |
| t | 3587 | 7.2% |
| e | 3511 | 7.1% |
| r | 3490 | 7.0% |
| u | 3282 | 6.6% |
| d | 3281 | 6.6% |
| i | 328 | 0.7% |
| n | 250 | 0.5% |
| Other values (10) | 525 | 1.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 11626 | |
| C | 6036 | |
| X | 6031 | |
| G | 3196 | 11.7% |
| O | 132 | 0.5% |
| A | 85 | 0.3% |
| M | 75 | 0.3% |
| B | 75 | 0.3% |
| F | 6 | < 0.1% |
| S | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6306 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 76832 | |
| Common | 6312 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 12667 | |
| a | 12599 | |
| I | 11626 | |
| l | 6046 | |
| C | 6036 | |
| X | 6031 | |
| t | 3587 | 4.7% |
| e | 3511 | 4.6% |
| r | 3490 | 4.5% |
| u | 3282 | 4.3% |
| Other values (22) | 7957 |
Common
| Value | Count | Frequency (%) |
| 6306 | ||
| / | 6 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 83144 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 12667 | |
| a | 12599 | |
| I | 11626 | |
| 6306 | ||
| l | 6046 | |
| C | 6036 | |
| X | 6031 | |
| t | 3587 | 4.3% |
| e | 3511 | 4.2% |
| r | 3490 | 4.2% |
| Other values (24) | 11245 |
| Distinct | 646 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Memory size | 74.6 KiB |
| 7/9/2007 | 106 |
|---|---|
| 6/11/2007 | 76 |
| 11/6/2007 | 71 |
| 5/8/2006 | 69 |
| 12/3/2007 | 67 |
| Other values (641) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.968905473 |
| Min length | 8 |
Characters and Unicode
| Total characters | 79321 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 49 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 11/10/2005 |
|---|---|
| 2nd row | 11/10/2005 |
| 3rd row | 5/27/2006 |
| 4th row | 8/21/2003 |
| 5th row | 5/8/2006 |
Common Values
| Value | Count | Frequency (%) |
| 7/9/2007 | 106 | 1.1% |
| 6/11/2007 | 76 | 0.8% |
| 11/6/2007 | 71 | 0.7% |
| 5/8/2006 | 69 | 0.7% |
| 12/3/2007 | 67 | 0.7% |
| 7/28/2003 | 67 | 0.7% |
| 5/11/2006 | 67 | 0.7% |
| 6/25/2007 | 64 | 0.7% |
| 8/20/2007 | 63 | 0.7% |
| 7/3/2007 | 54 | 0.6% |
| Other values (636) | 8140 | |
| (Missing) | 683 | 7.2% |
Length
| Value | Count | Frequency (%) |
| 7/9/2007 | 106 | 1.2% |
| 6/11/2007 | 76 | 0.9% |
| 11/6/2007 | 71 | 0.8% |
| 5/8/2006 | 69 | 0.8% |
| 5/11/2006 | 67 | 0.8% |
| 7/28/2003 | 67 | 0.8% |
| 12/3/2007 | 67 | 0.8% |
| 6/25/2007 | 64 | 0.7% |
| 8/20/2007 | 63 | 0.7% |
| 7/3/2007 | 54 | 0.6% |
| Other values (636) | 8140 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 19423 | |
| / | 17688 | |
| 2 | 13727 | |
| 1 | 8374 | |
| 7 | 5223 | 6.6% |
| 6 | 3936 | 5.0% |
| 5 | 2595 | 3.3% |
| 8 | 2518 | 3.2% |
| 3 | 2485 | 3.1% |
| 4 | 1855 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 61633 | |
| Other Punctuation | 17688 | 22.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 19423 | |
| 2 | 13727 | |
| 1 | 8374 | |
| 7 | 5223 | 8.5% |
| 6 | 3936 | 6.4% |
| 5 | 2595 | 4.2% |
| 8 | 2518 | 4.1% |
| 3 | 2485 | 4.0% |
| 4 | 1855 | 3.0% |
| 9 | 1497 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 17688 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 79321 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 19423 | |
| / | 17688 | |
| 2 | 13727 | |
| 1 | 8374 | |
| 7 | 5223 | 6.6% |
| 6 | 3936 | 5.0% |
| 5 | 2595 | 3.3% |
| 8 | 2518 | 3.2% |
| 3 | 2485 | 3.1% |
| 4 | 1855 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79321 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 19423 | |
| / | 17688 | |
| 2 | 13727 | |
| 1 | 8374 | |
| 7 | 5223 | 6.6% |
| 6 | 3936 | 5.0% |
| 5 | 2595 | 3.3% |
| 8 | 2518 | 3.2% |
| 3 | 2485 | 3.1% |
| 4 | 1855 | 2.3% |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Memory size | 74.6 KiB |
| Level 1 | |
|---|---|
| Level 2 | |
| Level 3 | |
| Level 4 | 200 |
| Other | 58 |
| Other values (3) | 21 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.986883763 |
| Min length | 5 |
Characters and Unicode
| Total characters | 61792 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Level 1 |
|---|---|
| 2nd row | Level 1 |
| 3rd row | Level 1 |
| 4th row | Level 1 |
| 5th row | Level 1 |
Common Values
| Value | Count | Frequency (%) |
| Level 1 | 4632 | |
| Level 2 | 2787 | |
| Level 3 | 1146 | 12.0% |
| Level 4 | 200 | 2.1% |
| Other | 58 | 0.6% |
| Level 6 | 18 | 0.2% |
| Level 7 | 2 | < 0.1% |
| Level 5 | 1 | < 0.1% |
| (Missing) | 683 | 7.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| level | 8786 | |
| 1 | 4632 | |
| 2 | 2787 | 15.8% |
| 3 | 1146 | 6.5% |
| 4 | 200 | 1.1% |
| other | 58 | 0.3% |
| 6 | 18 | 0.1% |
| 7 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 17630 | |
| L | 8786 | |
| v | 8786 | |
| l | 8786 | |
| 8786 | ||
| 1 | 4632 | 7.5% |
| 2 | 2787 | 4.5% |
| 3 | 1146 | 1.9% |
| 4 | 200 | 0.3% |
| O | 58 | 0.1% |
| Other values (6) | 195 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35376 | |
| Uppercase Letter | 8844 | 14.3% |
| Space Separator | 8786 | 14.2% |
| Decimal Number | 8786 | 14.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4632 | |
| 2 | 2787 | |
| 3 | 1146 | 13.0% |
| 4 | 200 | 2.3% |
| 6 | 18 | 0.2% |
| 7 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17630 | |
| v | 8786 | |
| l | 8786 | |
| t | 58 | 0.2% |
| h | 58 | 0.2% |
| r | 58 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 8786 | |
| O | 58 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 8786 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44220 | |
| Common | 17572 | 28.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 17630 | |
| L | 8786 | |
| v | 8786 | |
| l | 8786 | |
| O | 58 | 0.1% |
| t | 58 | 0.1% |
| h | 58 | 0.1% |
| r | 58 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 8786 | ||
| 1 | 4632 | |
| 2 | 2787 | 15.9% |
| 3 | 1146 | 6.5% |
| 4 | 200 | 1.1% |
| 6 | 18 | 0.1% |
| 7 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61792 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 17630 | |
| L | 8786 | |
| v | 8786 | |
| l | 8786 | |
| 8786 | ||
| 1 | 4632 | 7.5% |
| 2 | 2787 | 4.5% |
| 3 | 1146 | 1.9% |
| 4 | 200 | 0.3% |
| O | 58 | 0.1% |
| Other values (6) | 195 | 0.3% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Memory size | 74.6 KiB |
| Level 2 | |
|---|---|
| Level 1 | |
| Level 3 | |
| Level 4 | |
| Level 5 | 93 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 61908 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Level 2 |
|---|---|
| 2nd row | Level 2 |
| 3rd row | Level 1 |
| 4th row | Level 3 |
| 5th row | Level 1 |
Common Values
| Value | Count | Frequency (%) |
| Level 2 | 3208 | |
| Level 1 | 2479 | |
| Level 3 | 2033 | |
| Level 4 | 1031 | 10.8% |
| Level 5 | 93 | 1.0% |
| (Missing) | 683 | 7.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| level | 8844 | |
| 2 | 3208 | 18.1% |
| 1 | 2479 | 14.0% |
| 3 | 2033 | 11.5% |
| 4 | 1031 | 5.8% |
| 5 | 93 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 17688 | |
| L | 8844 | |
| v | 8844 | |
| l | 8844 | |
| 8844 | ||
| 2 | 3208 | 5.2% |
| 1 | 2479 | 4.0% |
| 3 | 2033 | 3.3% |
| 4 | 1031 | 1.7% |
| 5 | 93 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35376 | |
| Uppercase Letter | 8844 | 14.3% |
| Space Separator | 8844 | 14.3% |
| Decimal Number | 8844 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3208 | |
| 1 | 2479 | |
| 3 | 2033 | |
| 4 | 1031 | 11.7% |
| 5 | 93 | 1.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17688 | |
| v | 8844 | |
| l | 8844 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 8844 |
Space Separator
| Value | Count | Frequency (%) |
| 8844 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44220 | |
| Common | 17688 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8844 | ||
| 2 | 3208 | 18.1% |
| 1 | 2479 | 14.0% |
| 3 | 2033 | 11.5% |
| 4 | 1031 | 5.8% |
| 5 | 93 | 0.5% |
Latin
| Value | Count | Frequency (%) |
| e | 17688 | |
| L | 8844 | |
| v | 8844 | |
| l | 8844 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61908 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 17688 | |
| L | 8844 | |
| v | 8844 | |
| l | 8844 | |
| 8844 | ||
| 2 | 3208 | 5.2% |
| 1 | 2479 | 4.0% |
| 3 | 2033 | 3.3% |
| 4 | 1031 | 1.7% |
| 5 | 93 | 0.2% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.264133876 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.137448815 |
|---|---|
| Coefficient of variation (CV) | 0.3484688001 |
| Kurtosis | 1.40160972 |
| Mean | 3.264133876 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.9952793268 |
| Sum | 28868 |
| Variance | 1.293789807 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 3207 | |
| 2 | 2471 | |
| 4 | 2038 | |
| 5 | 666 | 7.0% |
| 6 | 406 | 4.3% |
| 7 | 22 | 0.2% |
| 8 | 14 | 0.1% |
| 1 | 8 | 0.1% |
| 9 | 7 | 0.1% |
| 10 | 5 | 0.1% |
| (Missing) | 683 | 7.2% |
| Value | Count | Frequency (%) |
| 1 | 8 | 0.1% |
| 2 | 2471 | |
| 3 | 3207 | |
| 4 | 2038 | |
| 5 | 666 | 7.0% |
| 6 | 406 | 4.3% |
| 7 | 22 | 0.2% |
| 8 | 14 | 0.1% |
| 9 | 7 | 0.1% |
| 10 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 10 | 5 | 0.1% |
| 9 | 7 | 0.1% |
| 8 | 14 | 0.1% |
| 7 | 22 | 0.2% |
| 6 | 406 | 4.3% |
| 5 | 666 | 7.0% |
| 4 | 2038 | |
| 3 | 3207 | |
| 2 | 2471 | |
| 1 | 8 | 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Memory size | 74.6 KiB |
| Confirmation | |
|---|---|
| Probation |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 10.79002714 |
| Min length | 9 |
Characters and Unicode
| Total characters | 95427 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Confirmation |
|---|---|
| 2nd row | Confirmation |
| 3rd row | Confirmation |
| 4th row | Confirmation |
| 5th row | Confirmation |
Common Values
| Value | Count | Frequency (%) |
| Confirmation | 5277 | |
| Probation | 3567 | |
| (Missing) | 683 | 7.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| confirmation | 5277 | |
| probation | 3567 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 17688 | |
| n | 14121 | |
| i | 14121 | |
| r | 8844 | |
| a | 8844 | |
| t | 8844 | |
| C | 5277 | 5.5% |
| f | 5277 | 5.5% |
| m | 5277 | 5.5% |
| P | 3567 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 86583 | |
| Uppercase Letter | 8844 | 9.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 17688 | |
| n | 14121 | |
| i | 14121 | |
| r | 8844 | |
| a | 8844 | |
| t | 8844 | |
| f | 5277 | 6.1% |
| m | 5277 | 6.1% |
| b | 3567 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 5277 | |
| P | 3567 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 95427 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 17688 | |
| n | 14121 | |
| i | 14121 | |
| r | 8844 | |
| a | 8844 | |
| t | 8844 | |
| C | 5277 | 5.5% |
| f | 5277 | 5.5% |
| m | 5277 | 5.5% |
| P | 3567 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 95427 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 17688 | |
| n | 14121 | |
| i | 14121 | |
| r | 8844 | |
| a | 8844 | |
| t | 8844 | |
| C | 5277 | 5.5% |
| f | 5277 | 5.5% |
| m | 5277 | 5.5% |
| P | 3567 | 3.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Memory size | 74.6 KiB |
| M | |
|---|---|
| F |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8844 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 7627 | |
| F | 1217 | 12.8% |
| (Missing) | 683 | 7.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| m | 7627 | |
| f | 1217 | 13.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 7627 | |
| F | 1217 | 13.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8844 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 7627 | |
| F | 1217 | 13.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8844 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 7627 | |
| F | 1217 | 13.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8844 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 7627 | |
| F | 1217 | 13.8% |
| Distinct | 1245 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Memory size | 74.6 KiB |
| 2/11/1961 | 45 |
|---|---|
| 1/7/1976 | 37 |
| 5/22/1974 | 30 |
| 2/7/1971 | 30 |
| 5/27/1955 | 29 |
| Other values (1240) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.810945274 |
| Min length | 8 |
Characters and Unicode
| Total characters | 77924 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 143 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | 2/17/1978 |
|---|---|
| 2nd row | 2/17/1978 |
| 3rd row | 3/3/1969 |
| 4th row | 8/14/1978 |
| 5th row | 2/7/1971 |
Common Values
| Value | Count | Frequency (%) |
| 2/11/1961 | 45 | 0.5% |
| 1/7/1976 | 37 | 0.4% |
| 5/22/1974 | 30 | 0.3% |
| 2/7/1971 | 30 | 0.3% |
| 5/27/1955 | 29 | 0.3% |
| 7/2/1967 | 28 | 0.3% |
| 5/26/1974 | 28 | 0.3% |
| 10/25/1977 | 27 | 0.3% |
| 5/12/1974 | 27 | 0.3% |
| 9/16/1967 | 26 | 0.3% |
| Other values (1235) | 8537 | |
| (Missing) | 683 | 7.2% |
Length
| Value | Count | Frequency (%) |
| 2/11/1961 | 45 | 0.5% |
| 1/7/1976 | 37 | 0.4% |
| 2/7/1971 | 30 | 0.3% |
| 5/22/1974 | 30 | 0.3% |
| 5/27/1955 | 29 | 0.3% |
| 7/2/1967 | 28 | 0.3% |
| 5/26/1974 | 28 | 0.3% |
| 5/12/1974 | 27 | 0.3% |
| 10/25/1977 | 27 | 0.3% |
| 9/16/1967 | 26 | 0.3% |
| Other values (1235) | 8537 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 17688 | |
| 1 | 16941 | |
| 9 | 10922 | |
| 7 | 8480 | |
| 2 | 5441 | 7.0% |
| 6 | 4764 | 6.1% |
| 8 | 3495 | 4.5% |
| 3 | 2955 | 3.8% |
| 5 | 2692 | 3.5% |
| 4 | 2463 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 60236 | |
| Other Punctuation | 17688 | 22.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 16941 | |
| 9 | 10922 | |
| 7 | 8480 | |
| 2 | 5441 | 9.0% |
| 6 | 4764 | 7.9% |
| 8 | 3495 | 5.8% |
| 3 | 2955 | 4.9% |
| 5 | 2692 | 4.5% |
| 4 | 2463 | 4.1% |
| 0 | 2083 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 17688 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 77924 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 17688 | |
| 1 | 16941 | |
| 9 | 10922 | |
| 7 | 8480 | |
| 2 | 5441 | 7.0% |
| 6 | 4764 | 6.1% |
| 8 | 3495 | 4.5% |
| 3 | 2955 | 3.8% |
| 5 | 2692 | 3.5% |
| 4 | 2463 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 77924 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 17688 | |
| 1 | 16941 | |
| 9 | 10922 | |
| 7 | 8480 | |
| 2 | 5441 | 7.0% |
| 6 | 4764 | 6.1% |
| 8 | 3495 | 4.5% |
| 3 | 2955 | 3.8% |
| 5 | 2692 | 3.5% |
| 4 | 2463 | 3.2% |
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.939733152 |
| Minimum | 0 |
|---|---|
| Maximum | 22 |
| Zeros | 2980 |
| Zeros (%) | 31.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 22 |
| Range | 22 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.150528806 |
|---|---|
| Coefficient of variation (CV) | 1.108672501 |
| Kurtosis | 3.68934091 |
| Mean | 1.939733152 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.53618874 |
| Sum | 17155 |
| Variance | 4.624774146 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2980 | |
| 1 | 1677 | |
| 2 | 1339 | |
| 3 | 1073 | 11.3% |
| 4 | 710 | 7.5% |
| 5 | 458 | 4.8% |
| 6 | 270 | 2.8% |
| 7 | 132 | 1.4% |
| 8 | 84 | 0.9% |
| 9 | 63 | 0.7% |
| Other values (7) | 58 | 0.6% |
| (Missing) | 683 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 2980 | |
| 1 | 1677 | |
| 2 | 1339 | |
| 3 | 1073 | 11.3% |
| 4 | 710 | 7.5% |
| 5 | 458 | 4.8% |
| 6 | 270 | 2.8% |
| 7 | 132 | 1.4% |
| 8 | 84 | 0.9% |
| 9 | 63 | 0.7% |
| Value | Count | Frequency (%) |
| 22 | 1 | < 0.1% |
| 16 | 6 | 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 4 | < 0.1% |
| 12 | 5 | 0.1% |
| 11 | 14 | 0.1% |
| 10 | 27 | 0.3% |
| 9 | 63 | |
| 8 | 84 | |
| 7 | 132 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7589326097 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 5283 |
| Zeros (%) | 55.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.188643743 |
|---|---|
| Coefficient of variation (CV) | 1.566204598 |
| Kurtosis | 4.693823949 |
| Mean | 0.7589326097 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.975780583 |
| Sum | 6712 |
| Variance | 1.412873948 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5283 | |
| 1 | 1799 | 18.9% |
| 2 | 936 | 9.8% |
| 3 | 471 | 4.9% |
| 4 | 225 | 2.4% |
| 5 | 83 | 0.9% |
| 6 | 30 | 0.3% |
| 7 | 7 | 0.1% |
| 8 | 6 | 0.1% |
| 9 | 4 | < 0.1% |
| (Missing) | 683 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 5283 | |
| 1 | 1799 | 18.9% |
| 2 | 936 | 9.8% |
| 3 | 471 | 4.9% |
| 4 | 225 | 2.4% |
| 5 | 83 | 0.9% |
| 6 | 30 | 0.3% |
| 7 | 7 | 0.1% |
| 8 | 6 | 0.1% |
| 9 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 4 | < 0.1% |
| 8 | 6 | 0.1% |
| 7 | 7 | 0.1% |
| 6 | 30 | 0.3% |
| 5 | 83 | 0.9% |
| 4 | 225 | 2.4% |
| 3 | 471 | 4.9% |
| 2 | 936 | 9.8% |
| 1 | 1799 | 18.9% |
| 0 | 5283 |
Manager_Business
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 3747 |
|---|---|
| Distinct (%) | 42.4% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 184370.9734 |
| Minimum | -265289 |
|---|---|
| Maximum | 3578265 |
| Zeros | 2904 |
| Zeros (%) | 30.5% |
| Negative | 14 |
| Negative (%) | 0.1% |
| Memory size | 74.6 KiB |
Quantile statistics
| Minimum | -265289 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 102178 |
| Q3 | 247116.5 |
| 95-th percentile | 692513 |
| Maximum | 3578265 |
| Range | 3843554 |
| Interquartile range (IQR) | 247116.5 |
Descriptive statistics
| Standard deviation | 274716.3231 |
|---|---|
| Coefficient of variation (CV) | 1.490019378 |
| Kurtosis | 18.15642379 |
| Mean | 184370.9734 |
| Median Absolute Deviation (MAD) | 102178 |
| Skewness | 3.366590283 |
| Sum | 1630576889 |
| Variance | 7.546905817 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2904 | |
| 20000 | 34 | 0.4% |
| 50000 | 19 | 0.2% |
| 25000 | 14 | 0.1% |
| 200000 | 10 | 0.1% |
| 30000 | 10 | 0.1% |
| 307003 | 10 | 0.1% |
| 302911 | 9 | 0.1% |
| 574520 | 8 | 0.1% |
| 52900 | 8 | 0.1% |
| Other values (3737) | 5818 | |
| (Missing) | 683 | 7.2% |
| Value | Count | Frequency (%) |
| -265289 | 3 | < 0.1% |
| -250757 | 1 | < 0.1% |
| -74587 | 4 | < 0.1% |
| -28419 | 1 | < 0.1% |
| -25889 | 1 | < 0.1% |
| -18472 | 1 | < 0.1% |
| -2408 | 1 | < 0.1% |
| -212 | 2 | < 0.1% |
| 0 | 2904 | |
| 466 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3578265 | 1 | |
| 3423740 | 1 | |
| 2792396 | 2 | |
| 2599295 | 1 | |
| 2413440 | 2 | |
| 2403457 | 2 | |
| 2306195 | 1 | |
| 2148532 | 1 | |
| 2105092 | 1 | |
| 2068335 | 1 |
Manager_Num_Products
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 57 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.152306649 |
| Minimum | 0 |
|---|---|
| Maximum | 101 |
| Zeros | 2909 |
| Zeros (%) | 30.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 5 |
| Q3 | 11 |
| 95-th percentile | 23 |
| Maximum | 101 |
| Range | 101 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.439350937 |
|---|---|
| Coefficient of variation (CV) | 1.179948141 |
| Kurtosis | 8.452121926 |
| Mean | 7.152306649 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 2.053397802 |
| Sum | 63255 |
| Variance | 71.22264423 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2909 | |
| 6 | 432 | 4.5% |
| 5 | 426 | 4.5% |
| 4 | 398 | 4.2% |
| 7 | 384 | 4.0% |
| 8 | 359 | 3.8% |
| 9 | 331 | 3.5% |
| 3 | 313 | 3.3% |
| 11 | 310 | 3.3% |
| 1 | 292 | 3.1% |
| Other values (47) | 2690 | |
| (Missing) | 683 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 2909 | |
| 1 | 292 | 3.1% |
| 2 | 288 | 3.0% |
| 3 | 313 | 3.3% |
| 4 | 398 | 4.2% |
| 5 | 426 | 4.5% |
| 6 | 432 | 4.5% |
| 7 | 384 | 4.0% |
| 8 | 359 | 3.8% |
| 9 | 331 | 3.5% |
| Value | Count | Frequency (%) |
| 101 | 2 | < 0.1% |
| 74 | 2 | < 0.1% |
| 66 | 4 | |
| 61 | 1 | < 0.1% |
| 60 | 5 | |
| 59 | 1 | < 0.1% |
| 53 | 2 | < 0.1% |
| 51 | 5 | |
| 48 | 1 | < 0.1% |
| 47 | 4 |
Manager_Business2
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 3743 |
|---|---|
| Distinct (%) | 42.3% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 182926.344 |
| Minimum | -265289 |
|---|---|
| Maximum | 3578265 |
| Zeros | 2909 |
| Zeros (%) | 30.5% |
| Negative | 14 |
| Negative (%) | 0.1% |
| Memory size | 74.6 KiB |
Quantile statistics
| Minimum | -265289 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 101714 |
| Q3 | 246461.25 |
| 95-th percentile | 676045.85 |
| Maximum | 3578265 |
| Range | 3843554 |
| Interquartile range (IQR) | 246461.25 |
Descriptive statistics
| Standard deviation | 271802.1459 |
|---|---|
| Coefficient of variation (CV) | 1.485855673 |
| Kurtosis | 18.58394594 |
| Mean | 182926.344 |
| Median Absolute Deviation (MAD) | 101714 |
| Skewness | 3.382484489 |
| Sum | 1617800586 |
| Variance | 7.387640651 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2909 | |
| 20000 | 34 | 0.4% |
| 50000 | 19 | 0.2% |
| 25000 | 14 | 0.1% |
| 30000 | 10 | 0.1% |
| 200000 | 10 | 0.1% |
| 307003 | 10 | 0.1% |
| 302911 | 9 | 0.1% |
| 714000 | 8 | 0.1% |
| 11640 | 8 | 0.1% |
| Other values (3733) | 5813 | |
| (Missing) | 683 | 7.2% |
| Value | Count | Frequency (%) |
| -265289 | 3 | < 0.1% |
| -250757 | 1 | < 0.1% |
| -74587 | 4 | < 0.1% |
| -28419 | 1 | < 0.1% |
| -25889 | 1 | < 0.1% |
| -18472 | 1 | < 0.1% |
| -2408 | 1 | < 0.1% |
| -212 | 2 | < 0.1% |
| 0 | 2909 | |
| 466 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3578265 | 1 | |
| 3423740 | 1 | |
| 2792396 | 2 | |
| 2599295 | 1 | |
| 2413440 | 2 | |
| 2403457 | 2 | |
| 2306195 | 1 | |
| 2148532 | 1 | |
| 2105092 | 1 | |
| 2068335 | 1 |
Manager_Num_Products2
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 57 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 683 |
| Missing (%) | 7.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.131275441 |
| Minimum | 0 |
|---|---|
| Maximum | 101 |
| Zeros | 2914 |
| Zeros (%) | 30.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 5 |
| Q3 | 11 |
| 95-th percentile | 23 |
| Maximum | 101 |
| Range | 101 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.42359672 |
|---|---|
| Coefficient of variation (CV) | 1.181218814 |
| Kurtosis | 8.552072166 |
| Mean | 7.131275441 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 2.06428762 |
| Sum | 63069 |
| Variance | 70.9569817 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2914 | |
| 6 | 435 | 4.6% |
| 5 | 424 | 4.5% |
| 4 | 405 | 4.3% |
| 7 | 385 | 4.0% |
| 8 | 355 | 3.7% |
| 9 | 331 | 3.5% |
| 3 | 314 | 3.3% |
| 11 | 311 | 3.3% |
| 1 | 290 | 3.0% |
| Other values (47) | 2680 | |
| (Missing) | 683 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 2914 | |
| 1 | 290 | 3.0% |
| 2 | 287 | 3.0% |
| 3 | 314 | 3.3% |
| 4 | 405 | 4.3% |
| 5 | 424 | 4.5% |
| 6 | 435 | 4.6% |
| 7 | 385 | 4.0% |
| 8 | 355 | 3.7% |
| 9 | 331 | 3.5% |
| Value | Count | Frequency (%) |
| 101 | 2 | < 0.1% |
| 74 | 2 | < 0.1% |
| 66 | 4 | |
| 61 | 1 | < 0.1% |
| 60 | 5 | |
| 59 | 1 | < 0.1% |
| 53 | 2 | < 0.1% |
| 51 | 5 | |
| 48 | 1 | < 0.1% |
| 47 | 4 |
Business_Sourced
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 74.6 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 9527 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 6260 | |
| 1 | 3267 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 6260 | |
| 1 | 3267 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6260 | |
| 1 | 3267 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9527 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6260 | |
| 1 | 3267 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9527 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6260 | |
| 1 | 3267 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9527 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6260 | |
| 1 | 3267 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| ID | Office_PIN | Application_Receipt_Date | Applicant_City_PIN | Applicant_Gender | Applicant_BirthDate | Applicant_Marital_Status | Applicant_Occupation | Applicant_Qualification | Manager_DOJ | Manager_Joining_Designation | Manager_Current_Designation | Manager_Grade | Manager_Status | Manager_Gender | Manager_DoB | Manager_Num_Application | Manager_Num_Coded | Manager_Business | Manager_Num_Products | Manager_Business2 | Manager_Num_Products2 | Business_Sourced | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | FIN1000001 | 842001 | 4/16/2007 | 844120.0 | M | 12/19/1971 | M | Others | Graduate | 11/10/2005 | Level 1 | Level 2 | 3.0 | Confirmation | M | 2/17/1978 | 2.0 | 1.0 | 335249.0 | 28.0 | 335249.0 | 28.0 | 0 |
| 1 | FIN1000002 | 842001 | 4/16/2007 | 844111.0 | M | 2/17/1983 | S | Others | Class XII | 11/10/2005 | Level 1 | Level 2 | 3.0 | Confirmation | M | 2/17/1978 | 2.0 | 1.0 | 335249.0 | 28.0 | 335249.0 | 28.0 | 1 |
| 2 | FIN1000003 | 800001 | 4/16/2007 | 844101.0 | M | 1/16/1966 | M | Business | Class XII | 5/27/2006 | Level 1 | Level 1 | 2.0 | Confirmation | M | 3/3/1969 | 0.0 | 0.0 | 357184.0 | 24.0 | 357184.0 | 24.0 | 0 |
| 3 | FIN1000004 | 814112 | 4/16/2007 | 814112.0 | M | 2/3/1988 | S | Salaried | Class XII | 8/21/2003 | Level 1 | Level 3 | 4.0 | Confirmation | F | 8/14/1978 | 0.0 | 0.0 | 318356.0 | 22.0 | 318356.0 | 22.0 | 0 |
| 4 | FIN1000005 | 814112 | 4/16/2007 | 815351.0 | M | 7/4/1985 | M | Others | Class XII | 5/8/2006 | Level 1 | Level 1 | 2.0 | Confirmation | M | 2/7/1971 | 2.0 | 1.0 | 230402.0 | 17.0 | 230402.0 | 17.0 | 0 |
| 5 | FIN1000006 | 814112 | 4/16/2007 | 814114.0 | M | 3/23/1988 | S | Others | Class XII | 1/17/2006 | Level 1 | Level 1 | 2.0 | Confirmation | M | 2/20/1979 | 0.0 | 0.0 | 247118.0 | 24.0 | 247118.0 | 24.0 | 1 |
| 6 | FIN1000007 | 842001 | 4/16/2007 | 844118.0 | M | 2/5/1969 | M | Business | Class XII | 9/1/2003 | Level 1 | Level 1 | 2.0 | Confirmation | M | 5/28/1969 | 0.0 | 0.0 | 315119.0 | 27.0 | 315119.0 | 27.0 | 1 |
| 7 | FIN1000008 | 800001 | 4/16/2007 | 844103.0 | M | 1/28/1984 | M | Salaried | Class XII | 12/16/2006 | Level 1 | Level 1 | 2.0 | Confirmation | M | 1/7/1976 | 5.0 | 4.0 | 117358.0 | 9.0 | 117358.0 | 9.0 | 0 |
| 8 | FIN1000009 | 209625 | 4/16/2007 | 206451.0 | M | 1/8/1976 | M | Business | Graduate | 11/18/2004 | Level 1 | Level 2 | 3.0 | Confirmation | M | 3/7/1966 | 0.0 | 0.0 | 244028.0 | 17.0 | 244028.0 | 17.0 | 1 |
| 9 | FIN1000010 | 211001 | 4/16/2007 | 212218.0 | M | 2/3/1982 | M | Others | Class XII | 8/15/2002 | Level 1 | Level 3 | 4.0 | Confirmation | M | 11/14/1974 | 0.0 | 0.0 | 851557.0 | 39.0 | 851557.0 | 39.0 | 1 |
Last rows
| ID | Office_PIN | Application_Receipt_Date | Applicant_City_PIN | Applicant_Gender | Applicant_BirthDate | Applicant_Marital_Status | Applicant_Occupation | Applicant_Qualification | Manager_DOJ | Manager_Joining_Designation | Manager_Current_Designation | Manager_Grade | Manager_Status | Manager_Gender | Manager_DoB | Manager_Num_Application | Manager_Num_Coded | Manager_Business | Manager_Num_Products | Manager_Business2 | Manager_Num_Products2 | Business_Sourced | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9517 | FIN1009518 | 121002 | 7/1/2008 | 121005.0 | M | 11/4/1980 | S | NaN | Graduate | 6/2/2008 | Level 3 | Level 3 | 4.0 | Probation | M | 10/17/1973 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 |
| 9518 | FIN1009519 | 121002 | 7/1/2008 | 121004.0 | M | 7/5/1981 | M | NaN | Graduate | 6/2/2008 | Level 3 | Level 3 | 4.0 | Probation | M | 10/17/1973 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 |
| 9519 | FIN1009520 | 250001 | 7/1/2008 | 250004.0 | F | 12/3/1965 | M | NaN | Graduate | 6/28/2006 | Level 1 | Level 2 | 3.0 | Confirmation | M | 6/3/1974 | 1.0 | 1.0 | 55000.0 | 2.0 | 55000.0 | 2.0 | 0 |
| 9520 | FIN1009521 | 753012 | 7/1/2008 | 754031.0 | M | 4/26/1984 | S | NaN | Class XII | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 |
| 9521 | FIN1009522 | 814112 | 7/1/2008 | 816118.0 | M | 11/20/1969 | M | NaN | Class XII | 10/3/2006 | Level 1 | Level 1 | 2.0 | Confirmation | M | 9/26/1955 | 4.0 | 2.0 | 418339.0 | 13.0 | 418339.0 | 13.0 | 0 |
| 9522 | FIN1009523 | 160017 | 7/1/2008 | 160032.0 | M | 1/18/1970 | M | Salaried | Graduate | 5/5/2008 | Level 2 | Level 2 | 3.0 | Probation | M | 5/10/1967 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 |
| 9523 | FIN1009524 | 848101 | 7/1/2008 | 848302.0 | M | 9/11/1956 | M | NaN | Graduate | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 |
| 9524 | FIN1009525 | 753012 | 7/1/2008 | 753014.0 | F | 8/7/1975 | M | Salaried | Graduate | 8/22/2006 | Level 2 | Level 2 | 3.0 | Confirmation | M | 7/22/1970 | 0.0 | 0.0 | 316126.0 | 9.0 | 305775.0 | 8.0 | 0 |
| 9525 | FIN1009526 | 575003 | 7/1/2008 | 571248.0 | M | 12/23/1986 | S | Salaried | Class XII | 6/5/2008 | Level 3 | Level 3 | 4.0 | Probation | M | 9/23/1976 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 |
| 9526 | FIN1009527 | 411006 | 7/1/2008 | 411006.0 | F | 2/7/1976 | M | Others | Graduate | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 |